The Utility of Randomness in Decision Tree Ensembles
نویسندگان
چکیده
The use of randomness in constructing decision tree ensembles has drawn much attention in the machine learning community. In general, ensembles introduce randomness to generate diverse trees and in turn they enhance ensembles’ predictive accuracy. Examples of such ensembles are Bagging, Random Forests and Random Decision Tree. In the past, most of the random tree ensembles inject various kinds of randomness into deterministic models. Very few of these ensembles considered variable randomness or found it insensitive to performance. In contrast, this thesis uses complete-random tree ensembles as a starting point to investigate the utility of randomness and finishes with a variable random model, capable of finding the appropriate settings of randomness for individual data sets in order to improve predictive accuracy. Firstly, we construct a taxonomy of tree randomisations to categorise existing randomisation techniques. Then, we analyse the benefits and problems of different randomisation techniques to gain a better understanding of their effects. Secondly, we find that the key component of random tree ensembles is simply the probability averaging ensemble method. Based on the results of vigorous experimentations, probability averaging brings out the best of complete-randomness in decision tree ensembles. Using this key component alone permits the highest degree of diversity. We name this complete-random tree algorithm Max-diverse Ensemble as it achieves exceptional accuracy by maximising diversity. Interestingly, without the presence of any feature selection criterion, Max-diverse Ensemble’s accuracy is comparable to Random Forests, a popular implementation of random tree ensemble. Furthermore, visual evidence shows that complete-randomness provides a distinctive representational power to model target concepts. Taking the advantages of this representational power, we propose a decision tree algorithm with variable randomness. It is called Max-diverse.α. Max-diverse.α forms a smooth convex error-rate contour using different degrees of randomness. In many cases, individual data sets with appropriate settings of randomness achieve better accuracy than the complete-random settings. Finally, we propose a simple estimation technique for estimating an effective settings of randomness generated entirely from the progressive training errors. Applying the estimation technique, Max-diverse.α improves significantly from Max-diverse Ensemble. The experimental results show that Max-diverse.α performs significantly better than Random Forests and comparably with the state-of-the-art C5 Boosting.
منابع مشابه
Biological Image Classification with Random Subwindows and Extra-Trees
We illustrate the potential of our image classification method on three datasets of images at different imaging modalities/scales, from subcellular locations up to human body regions. The method is based on random subwindows extraction and the combination of their classification using ensembles of extremely randomized decision trees.
متن کاملSpectrum of Variable-Random Trees
In this paper, we show that a continuous spectrum of randomisation exists, in which most existing tree randomisations are only operating around the two ends of the spectrum. That leaves a huge part of the spectrum largely unexplored. We propose a base learner VR-Tree which generates trees with variable-randomness. VR-Trees are able to span from the conventional deterministic trees to the comple...
متن کاملLearning with Ensembles of Randomized Trees : New Insights
Ensemble of randomized trees such as Random Forests are among the most popular tools used in machine learning learning and data mining. Such algorithms work by introducing randomness in the induction of several decision trees before employing a voting scheme to give a prediction for unseen instances. In this paper, randomized trees ensembles are studied in the point of view of the basis functio...
متن کاملEnsembles of randomized trees using diverse distributed representations of clinical events
BACKGROUND Learning deep representations of clinical events based on their distributions in electronic health records has been shown to allow for subsequent training of higher-performing predictive models compared to the use of shallow, count-based representations. The predictive performance may be further improved by utilizing multiple representations of the same events, which can be obtained ...
متن کاملRisk premiums and certainty equivalents of loss-averse newsvendors of bounded utility
Loss-averse behavior makes the newsvendors avoid the losses more than seeking the probable gains as the losses have more psychological impact on the newsvendor than the gains. In economics and decision theory, the classical newsvendor models treat losses and gains equally likely, by disregarding the expected utility when the newsvendor is loss-averse. Moreover, the use of unbounded utility to m...
متن کامل